Multi-View Hierarchical Semi-supervised Learning by Optimal Assignment of Sets of Labels to Instances

نویسندگان

  • Bhavana Dalvi
  • William W. Cohen
چکیده

In multiclass semi-supervised learning, sometimes the information about datapoints is present in multiple views. In this paper we propose an optimization based method to tackle semi-supervised learning in the presence of multiple views. Our techniques make use of mixed integer linear programming formulations along with the EM framework to find consistent class assignments given the scores in each data view. We compare our techniques against existing baselines, including a cotrain variant for K-Means, on a number of multi-view datasets. Our proposed techniques give state-of-the-art performance in terms of F1 score, outperforming a well-studied SSL method based on co-training. Further, we show that our techniques can be easily extended to multi-view learning in the presence of hierarchical class constraints. These extensions improve the macro-averaged F1 score on a hierarchical multi-view dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

Multiple Instance Learning with the Optimal Sub-Pattern Assignment Metric

Multiple instance data are sets or multi-sets of unordered elements. Using metrics or distances for sets, we propose an approach to several multiple instance learning tasks, such as clustering (unsupervised learning), classification (supervised learning), and novelty detection (semi-supervised learning). In particular, we introduce the Optimal Sub-Pattern Assignment metric to multiple instance ...

متن کامل

Learning with Low-Quality Data: Multi-View Semi-Supervised Learning with Missing Views

The focus of this thesis is on learning approaches for what we call “low-quality data” and in particular data in which only small amounts of labeled target data is available. The first part provides background discussion on low-quality data issues, followed by preliminary study in this area. The remainder of the thesis focuses on a particular scenario: multi-view semi-supervised learning. Multi...

متن کامل

Semi-supervised Multi-label Learning Algorithm Using Dependency Among Labels

In this paper, we present a semi-supervised algorithm for multi-label learning by exploring the relationship among labels. Based on the accuracy, we determine the classification order for labels, a list of classifiers is trained by this order, with each classifier being trained by using the outputs of the previous classifiers in the list as additional input features. Experiments on three multi-...

متن کامل

A Novel Multi Label Learning Based on Clustering Integrated Ensemble Classifier Chain Micro Prediction Models

Most of the real world problems are concerned with assignment of multiple target labels to the instances. The proposed model aims to increase the accuracy by incorporating supervised and semi supervised learning. K Means clustering is employed which creates K clusters based on the initialization of cluster centroids. Datasets are clustered based on its distribution in the Euclidean space. Clust...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014